Image processing apparatus, image processing method, and storage medium

ABSTRACT

An image processing apparatus including: a determination unit configured to determine a similar form similar to scanned image for each of a plurality of scanned images; and a display control unit configured to perform control such that a list of the plurality of scanned images is displayed on a display unit, wherein scanned images of which the determined similar form are same are displayed together in the list.

BACKGROUND OF THE DISCLOSURE Field of the Disclosure

The present disclosure relates to a process of displaying files ofscanned images.

Description of the Related Art

As a document management method, there is a system that managesdocuments by converting scanned images obtained by reading the documentswith a scanner into files of a predetermined format, and setting thefile names of these scanned images based on the contents described inthe respective documents. In a case where a plurality of documents to becomputerized are scanned together, it is conceivable to present thefiles of the plurality of scanned images in a list form to the user. Bypresenting the files in a list form to the user, the user can select adesired file from the list and perform operations on the file.

Japanese Patent Laid-Open No. H08-221558 discloses a method involvingdetermining the types of read documents by comparing their image datawith predefined template document information, and displaying thedocuments such that those of the same type are displayed together.

However, the method of Japanese Patent Laid-Open No. H08-221558 cannotdisplay documents of the same type together if their image data do notmatch the template document information and therefore their documenttype cannot be determined. This may increase the burden on the user foroperations such as finding the file of a desired scanned image from alist of files.

SUMMARY OF THE DISCLOSURE

An object of the present disclosure is to ease the burden on a user foroperations for configuring settings of scanned images.

An image processing apparatus of the present disclosure includes: adetermination unit configured to determine a similar form similar toscanned image for each of a plurality of scanned images; and a displaycontrol unit configured to perform control such that a list of theplurality of scanned images is displayed on a display unit, whereinscanned images of which the determined similar form are same aredisplayed together in the list.

Further features of the present disclosure will become apparent from thefollowing description of exemplary embodiments with reference to theattached drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram illustrating an entire configuration of an imageprocessing system;

FIG. 2 is a block diagram illustrating a hardware configuration of anMFP;

FIG. 3 is a block diagram illustrating hardware configurations of an MFPcooperative server and a storage server;

FIG. 4 is a block diagram illustrating a functional configuration of theimage processing system;

FIG. 5 is a sequence chart illustrating a flow of processing in theentire image processing system;

FIG. 6 is a diagram illustrating an example of a UI screen of a mainmenu;

FIG. 7 is a diagram illustrating an example of a login screen;

FIG. 8 is a diagram illustrating an example of a scan setting screen;

FIG. 9 is a diagram illustrating an example of a storage save screen;

FIGS. 10A and 10B are diagrams illustrating an example of a file namesetting screen;

FIG. 11 is a flowchart illustrating details of an image analysisprocess;

FIGS. 12A and 12B are diagrams illustrating examples of the datastructure of analysis result information;

FIGS. 13A and 13B are diagrams illustrating an example of a group namelist;

FIG. 14 is a flowchart illustrating details of a group name updateprocess;

FIG. 15 is a diagram for explaining a process of comparing characterstrings;

FIG. 16 is a flowchart illustrating details of a group name generationprocess;

FIGS. 17A and 17B are diagrams illustrating an example of the group namelist;

FIGS. 18A and 18B are diagrams illustrating an example of a new form anda similar form similar to the new form;

FIGS. 19A and 19B are diagrams illustrating an example of characterrecognition results of scanned images;

FIG. 20 is a flowchart illustrating details of a group name generationprocess;

FIG. 21 is a diagram illustrating an example of data generated by a fileinformation input process; and

FIG. 22 is a diagram for explaining a process of comparing characterstrings.

DESCRIPTION OF THE EMBODIMENTS

Embodiments of the technique of the present disclosure will be describedbelow with reference the drawings. Note that the following embodimentsdo not limit the invention according to the claims, and not all thecombinations of the features described in the embodiments arenecessarily essential for the solution provided by the invention.

Embodiment 1

[System Configuration]

FIG. 1 is a diagram illustrating an entire configuration of an imageprocessing system 100 according to Embodiment 1. The image processingsystem 100 includes a multifunction peripheral (MFP) 110, an MFPcooperative server 120, and a storage server 130. The MFP 110 iscommunicatively connected to a server that provides various servicesover the Internet via a local area network (LAN).

FIG. 1 is a diagram illustrating an entire configuration of an imageprocessing system 100 according to Embodiment 1. The image processingsystem 100 includes a multifunction peripheral (MFP) 110, and serverapparatuses 120 and 130 that provide cloud services over the Internet.The MFP 110 is communicatively connected to the server apparatuses 120and 130 via the Internet.

The MFP 110 is an example of an image processing apparatus having a scanfunction. The MFP 110 is a multifunction peripheral having a pluralityof functions such as a print function and a BOX save function inaddition to the scan function. Both of the server apparatuses 120 and130 are an example of an image processing apparatus that provides acloud service. The server apparatus 120 in the present embodimentprovides a cloud service that performs an image analysis on scannedimages received from the MFP 110 and transfers a request from the MFP110 to the server apparatus 130 providing another service.

Hereinafter, the cloud service provided by the server apparatus 120 willbe referred to as “MFP cooperative service”. The server apparatus 130provides a cloud service that saves files sent thereto via the Internetand provides a saved file in response to a request from a web browser ona mobile terminal (not illustrated) or the like (hereinafter referred toas “storage service”). In the present embodiment, the server apparatus120, which provides the MFP cooperative service, will be referred to as“MFP cooperative server”, and the server apparatus 130, which providesthe storage service, will be referred to as “storage server”.

The configuration of the image processing system 100 illustrated in FIG.1 is an example, and the configuration is not limited to this. Forexample, the MFP 110 may also have the function of the MFP cooperativeserver 120. Also, the MFP cooperative server 120 may be connected to theMFP 110 not via the Internet but via a local area network (LAN). Also,the storage server 130 may be replaced with a mail server thatimplements a mail distribution service, and used in a situation ofattaching a scanned image of a document to mail and transmitting it.

[Hardware Configuration of MFP]

FIG. 2 is a block diagram illustrating a hardware configuration of theMFP 110. The MFP 110 has a control unit 210, an operation unit 220, aprinter unit 221, a scanner unit 222, and a modem 223. The control unit210 includes components 211 to 219 below and controls the operation ofthe entire MFP 110. The CPU 211 reads out and executes various controlprograms stored in the ROM 212 (programs corresponding to the variousfunctions illustrated in the functional configuration diagram to bementioned later). The RAM 213 is used as temporary storage areas such asa main memory and a work area for the CPU 211. Note that in the presentembodiment, a single CPU 211 uses a single memory (RAM 213 or HDD 214)to execute processes illustrated in flowcharts to be mentioned later.However, the configuration is not limited to this. For example, aplurality of CPUs and a plurality of RAMs or HDDs may be caused tocooperate with each other to execute the processes. The HDD 214 is alarge-capacity storage unit that stores image data and various programs.The operation unit I/F 215 is an interface that connects the operationunit 220 and the control unit 210.

The operation unit 220 includes a touchscreen that functions also as adisplay unit, a keyboard, and the like, and accepts user operations,inputs, and instructions. Note that touch operations on the touchscreeninclude operations with a person's finger and operations with a stylus.The printer I/F 216 is an interface that connects the printer unit 221and the control unit 210. Image data to be printed is transferred fromthe control unit 210 to the printer unit 221 via the printer I/F 216,and printed onto a print medium such as paper. The scanner I/F 217 is aninterface that connects the scanner unit 222 and the control unit 210.The scanner unit 222 generates scanned image data by optically reading adocument set on platen glass or auto document feeder (ADF) notillustrated, and inputs the scanned image data to the control unit 210via the scanner I/F 217. The scanned image data generated by the scannerunit 222 can be printed (copied and outputted) by the printer unit 221,saved to the HDD 214, or transmitted in the form of a file to anexternal apparatus such as the MFP cooperative server 120 via a LAN. Themodem I/F 218 is an interface that connects the modem 223 and thecontrol unit 210. The modem 223 performs facsimile communication ofimage data with a facsimile (not illustrated) on a PSTN. The network I/F219 is an interface that connects the control unit 210 (MFP 110) to aLAN. The MFP 110 transmits scanned image data to the MFP cooperativeserver 120 or receives various pieces of data from the MFP cooperativeserver 120 by using the network I/F 219. The hardware configuration ofthe MFP 110 described above is an example and may include anotherconfiguration(s) or omit part of the configuration as necessary.

[Hardware Configurations of Servers]

FIG. 3 is a block diagram illustrating hardware configurations of theMFP cooperative server 120 and the storage server 130. The MFPcooperative server 120 and the storage server 130 have a common hardwareconfiguration, and include a CPU 311, a ROM 312, a RAM 313, an HDD 314,and a network I/F 315. The CPU 311 reads out a control program stored inthe ROM 312 and executes various processes to control the entireoperation. The RAM 313 is used as temporary storage areas such as a mainmemory and a work area for the CPU 311. The HDD 314 is a large-capacitystorage unit that stores image data and various programs. The networkI/F 315 is an interface that connects the control unit 310 to theInternet. The MFP cooperative server 120 and the storage server 130receive requests for various processes from other apparatuses (such asthe MFP 110) and return process results corresponding to these requestsvia the network I/F 315.

[Functional Configuration of Image Processing System]

FIG. 4 is a block diagram illustrating a functional configuration of theimage processing system 100 according to the present embodiment.Functional configurations corresponding to the roles of the MFP 110 andthe MFP cooperative server 120 and storage server 130 forming the imageprocessing system 100 will be described below in turn. Note that thedescription will focus on, of the various functions of each apparatus,functions related to processes from scanning a document to computerizeit (convert it into a file) to saving the computerized document to thestorage server 130.

[Functional Configuration of MFP]

The function modules of the MFP 110 are generally classified into twofunction modules, namely, a native function module 410 and an additionalfunction module 420. The native function module 410 is an applicationnormally included in the MFP 110 whereas the additional function module420 is an application additionally installed in the MFP 110. Theadditional function module 420 is an application based on Java(registered trademark), which enables its functions to be easily addedto the MFP 110. Note that other additional applications not illustratedmay be installed in the MFP 110.

The native function module 410 has a scan execution unit 411 and ascanned image management unit 412. The additional function module 420has a display control unit 421, a scan control unit 422, a cooperativeservice request unit 423, and an image processing unit 424.

The display control unit 421 displays user interface screens (UIscreens) for accepting various user operations on the touchscreen of theoperation unit 220. Examples of the various user operations includeentering login authentication information for accessing the MFPcooperative server 120, configuring the scan settings, issuing aninstruction to start a scan, setting a file name, issuing an instructionto save a file, and so on.

In response to an user operation performed on a UI screen (e.g.,pressing a “Start scan” button), the scan control unit 422 issues aninstruction to execute a scan process to the scan execution unit 411along with scan setting information. According to the instruction toexecute a scan process from the scan control unit 422, the scanexecution unit 411 causes the scanner unit 222 to execute a documentread process via the scanner I/F 217 to thereby generate scanned imagedata. The scanned image management unit 412 saves the generated scannedimage data to the HDD 214. At this time, the scan control unit 422 isnotified of information of a scanned image identifier uniquelyindicating the saved scanned image data. The scanned image identifier isa number, a symbol, alphabetical characters, or the like uniquelyidentifying the image scanned by the MFP 110. The scan control unit 422,for example, obtains scanned image data to be converted into a file fromthe scanned image management unit 412 by using its scanned imageidentifier mentioned above. Then, the scan control unit 422 instructsthe cooperative service request unit 423 to issue a request to the MFPcooperative server 120 for processes necessary for the conversion into afile.

The cooperative service request unit 423 issues requests to the MFPcooperative server 120 for various processes, and receives responses tothese requests. Examples of the various processes include loginauthentication, analysis of the scanned image, transmission of thescanned image data, and so on. A communication protocol such as REST orSOAP is used in the communication with the MFP cooperative server 120.The image processing unit 424 performs predetermined processes on thescanned image data to generate an image to be used in an UI screen to bedisplayed by the display control unit 421.

Note that an apparatus other than the MFP 110 (such as a client PC notillustrated) may have some of the functions implemented by theabove-described additional function module 420 and the like.Specifically, the system configuration may be such that the client PCrequests an analysis of a scanned image obtained by the MFP 110, sets afile name based on the analysis result, and performs other similaroperations.

Also, in the case where the client PC sets a file name and performsother similar operations, the configuration may be such that a program(module) for setting a file name or the like is installed in the clientPC in advance. However, the configuration is not limited to this. Forexample, the configuration may be such that a general-purpose webbrowser provided to the client PC is utilized to obtain a webapplication for setting a file name or the like from the MFP cooperativeserver 120 and execute it.

[Functional Configurations of Server Apparatuses]

First, the functional configuration of the MFP cooperative server 120will be described. The MFP cooperative server 120 has a request controlunit 431, an image processing unit 432, a storage server access unit433, a data management unit 434, and a display control unit 435. Therequest control unit 431 stands by in a state where it can receiverequests from external apparatuses, and instructs the image processingunit 432, the storage server access unit 433, and the data managementunit 434 to execute predetermined processes according to the contents ofa request received.

The image processing unit 432 performs, on scanned image data sent fromthe MFP 110, an analysis process such as a process of detectingcharacter regions, a character recognition process (OCR process), and aprocess of determining whether a similar document is present, and alsoan image modification process such as turning the scanned image data andcorrecting its tilt. Note that each character region detected from ascanned image will be hereinafter referred to as “text block”. Also, thefollowing description will be given using a form such as a quotation oran invoice as an example of the target document, and its scanned imagemay be referred to as “form image”.

The storage server access unit 433 issues a request to the storageserver 130 for processes. Cloud services have made various interfacespublicly available which are for saving files to a storage server andobtaining saved files by using a protocol such as REST or SOAP. Thestorage server access unit 433 issues requests to the storage server 130by using a publicly available interface. The data management unit 434holds and manages user information, image analysis results, variouspieces of setting data, and so on to be managed in the MFP cooperativeserver 120.

The display control unit 435 returns screen layout information (such asHTML and CSS) necessary to display a screen in response to receiving arequest from a web browser running on a PC or mobile terminal (neitherof which is illustrated) connected via the Internet. Via the screendisplayed on the web browser, the user can check registered userinformation and change the scan settings.

Next, the functional configuration of the storage server 130 will bedescribed. The storage server 130 has a request control unit 441, a filemanagement unit 442, and a display control unit 443. The request controlunit 441 stands by in a state where it can receive requests fromexternal apparatuses and, in the present embodiment, instructs the filemanagement unit 442 to save a received file or read out a saved file inresponse to a request from the MFP cooperative server 120. The requestcontrol unit 441 then returns a response corresponding to the request tothe MFP cooperative server 120. The display control unit 443 returnsscreen layout information (such as HTML and CSS) necessary to display ascreen in response to receiving a request from a web browser running ona PC or mobile terminal (neither of which is illustrated) connected viathe Internet. Via the screen displayed on the web browser, the user cancheck and obtain saved files.

[Entire Flow of Processing]

FIG. 5 is a sequence chart illustrating a flow of processing performedbetween the apparatuses in a case of scanning documents (forms) with theMFP 110, converting the obtained form images into files, and saving themto the storage server. FIG. 6 is a diagram illustrating an example of aUI screen of a main menu displayed upon activation of the MFP 110(hereinafter referred to as “main screen”). By installing a dedicatedapplication necessary for scanning a document, converting it into afile, and utilizing the cloud storage service into the MFP 110, a “Scanand save to cloud storage” button 601 is displayed in a main screen 600.Then, when the user presses the “Scan and save to cloud storage” button601 among the menu buttons displayed in the main screen 600, the seriesof processes illustrated in the sequence chart in FIG. 5 is started.Note that the sequence chart in FIG. 5 describes a case of setting filenames to be given to scanned images at the time of converting them intofiles. However, the sequence chart is applicable to general propertiessuch as folder paths and metadata, besides file names.

The communication between the apparatuses will be described below inchronological order along the sequence chart in FIG. 5. Note that thesymbols “S” in the following description represent steps.

The MFP 110 displays a UI screen for entering login authenticationinformation for accessing the MFP cooperative server 120 (hereinafterreferred to as “login screen”) on the operation unit 220 (S501). FIG. 7illustrates an example of the login screen. When the user enters apreregistered user ID and password respectively into entry fields 702and 703 in a login screen 700 and presses a “Login” button 701, a loginauthentication request is transmitted to the MFP cooperative server 120(S502).

In response to receiving the login authentication request, the MFPcooperative server 120 performs an authentication process by using theuser ID and password contained in the request (S503). In a case wherethe user is confirmed as a genuine user by the authentication process,the MFP cooperative server 120 returns an access token to the MFP 110.When various requests are subsequently issued from the MFP 110 to theMFP cooperative server 120, this access token is sent along with them sothat the logged-in user can be identified. In the present embodiment, bycompleting login to the MFP cooperative server 120, login to the storageserver 130 is also completed at the same time. Thus, the user links auser ID for utilizing the MFP cooperative service and a user ID forutilizing the storage service with each other via a web browser on a PC(not illustrated) or the like on the Internet in advance. In this way,if the authentication of login to the MFP cooperative server 120succeeds, the authentication of login to the storage server 130 is alsocompleted at the same time. This can omit an operation for logging in tothe storage server 130. The MFP cooperative server 120 can handlerequests related to the storage service from the user having logged into the MFP cooperative server 120. Note that, as for the loginauthentication method, a generally and publicly known method (such asBasic authentication, Digest authentication, or authentication usingOAuth) may be used to perform the login authentication.

After the login is completed, a UI screen for configuring the scansettings (hereinafter referred to as “scan setting screen”) is displayedon the operation unit 220 of the MFP 110 (S504). FIG. 8 illustrates anexample of the scan setting screen. A scan setting screen 800 includes a“Start scan” button 801, a color setting field 802, and a resolutionsetting field 803. The “Start scan” button 801 is a button for issuingan instruction to start a scan process on a document (assumed to be aform such as a quotation or an invoice in the present embodiment) set onthe platen glass. In the color setting field 802, the color mode forscanning is set. The color mode can be designated from among choicessuch as full color and black-and-white, for example. In the resolutionsetting field 803, the resolution for scanning is set. The resolutioncan be designated from among choices such as 600 dpi and 1200 dpi, forexample. Note that the color mode and the resolution are an example ofsetting items. Not all of them have to be present, and/or a setting itemother than them may be present. Also, the choices for the color mode andthe resolution may be limited only to setting values required by thestorage service. The logged-in user sets specific conditions for thescan process via the scan setting screen 800 as above. When thelogged-in user having finished configuring the scan settings sets ascan-target document on the platen glass of the MFP 110 and presses the“Start scan” button 801, a scan is executed (S505). Consequently,scanned image data as a computerized paper document is generated. Afterthe completion of the scan, the MFP 110 transmits the scanned image dataobtained by the scan to the MFP cooperative server 120 along with ananalysis request for the scanned image data (S506).

In the MFP cooperative server 120 having received the analysis requestfor the form image, the request control unit 431 instructs the imageprocessing unit 432 to execute an analysis process (S507). In doing so,the request control unit 431 returns a request ID which can uniquelyidentify the received analysis request to the MFP 110.

On the other hand, the image processing unit 432, in response toreceiving the instruction to execute the analysis process, executes theanalysis process on the form image associated with the analysis request(S508). In this analysis process, the image processing unit 432 firstlyperforms a process of detecting the text blocks present in theanalysis-target form image. For the text block detection, a known methodmay be employed such as a method in which rectangular regions assumed tobe characters are extracted from an image binarized based on a thresholdvalue, for example. Thereafter, the image processing unit 432 performs acharacter recognition process (OCR: Optical Character Recognition) oneach text block found by the text block detection process.

The image processing unit 432 then performs a process of determiningwhether the analysis-target form image is similar to a previouslycomputerized form image by matching the analysis-target form image withforms registered as learned data (form determination process). Aprevious form (image) similar to the analysis-target form image will bereferred to as the similar form.

This form determination process uses arrangement information indicatingwhere each text block present in the analysis-target form image islocated in this form image. Specifically, the arrangement information onthe current analysis-target form image and the arrangement informationon a previous form image registered in the learned data or the like arecompared with each other to determine whether or not their text blockarrangements are the same or similar. This is based on the idea thatforms can be assumed as forms of the same type created by using the samedocument format (form format) if their text block arrangements are thesame or similar.

The result obtained by this analysis process is passed to the requestcontrol unit 431. Note that in the present embodiment, whether the formsare similar or not is determined based solely on the degree of matchbetween their text block arrangements. However, for example, the typesof the forms (quotation, invoice, etc.) may be identified from their OCRresults and whether they are similar or not may be determined with theobtained type information taken into account. Note that details of thisstep will be described later with reference to FIG. 11.

While the above analysis process is performed, the MFP 110 inquires ofthe MFP cooperative server 120 about the processing status regularly(e.g., at intervals of about several hundred milliseconds to severalmilliseconds) by using the above-mentioned request ID (S509 to S509′).This inquiry is repeatedly made until a response indicating completionof the analysis process is obtained from the MFP cooperative server 120(S510). In response to receiving the inquiry about the processingstatus, the MFP cooperative server 120 checks the progress of theanalysis process associated with the request ID and, in a case where theprocess has not been completed, returns a response indicating that theprocess is still being performed.

The response contains “status” as an item indicating the currentprocessing status, and a character string held in “status” indicates thecurrent processing status. For example, in a case where the characterstring held in “status” is “processing”, it indicates that the processis being performed by the MFP cooperative server 120. In a case of“completed”, it indicates that the process has been completed. In a caseof “failed”, it indicates that the process has failed. In a case of“completed”, it indicates that the process has been completed. Notethat, the response contains analysis result information obtained byanalyzing the scanned image, scan setting information, and so on inaddition to “status”.

After receiving a process completion response, the MFP 110 issues arequest to the MFP cooperative server 120 for the result of the analysisprocess by using a URL indicating a storage destination for the imageanalysis result contained in the response (S511). In response toreceiving the request, the request control unit 431 of the MFPcooperative server 120 returns information on the result of the analysisprocess.

Thereafter, the series of processes from displaying the scan settingscreen to obtaining an image analysis result (S504 to S511) is repeatedas many times as the number of forms to be converted into a file.Alternatively, the plurality of forms may be scanned together in S505and, based on an analysis request for each form image (S506), ananalysis instruction (S507) may be issued and an analysis process (S508)may be performed on a form-by-form basis.

The MFP 110 requests an UI screen (storage save screen) on which tocheck a list of the scanned forms, set the file name of a form image,and designate a file to be saved to the storage server 130 (S512). TheMFP 110 displays the storage save screen on the touchscreen of theoperation unit 220 based on screen information generated according tocontrol by the display control unit 435 of the MFP cooperative server120 (S513). Details of the storage save screen will be described later.

The UI screen is displayed, for example, by a web browser installed inthe MFP 110. Using a URL for displaying the storage save screen, the MFP110 accesses this URL to obtain screen information for displaying thestorage save screen from the MFP cooperative server 120. The user cancheck a list of the files of the form images on the storage save screenwith the web browser of the MFP 110.

When the user selects the file of a form image on a scanned form list901 (see FIG. 9) in a storage save screen 900 and presses an “Edit”button 903 (see FIG. 9), the MFP 110 accepts these inputs. The MFP 110then displays a file name setting screen 1000 (see FIGS. 10A and 10B)for setting the file name of the selected form image. The file namesetting screen 1000 will be described later. Thereafter, when the userenters a file name on the file name setting screen 1000 and presses an“OK” button 1007 (see FIGS. 10A and 10B), the MFP 110 accepts theseinputs and returns the screen to the storage save screen 900.

When the user presses a “Send” button 902 (see FIG. 9) in the storagesave screen 900, the MFP 110 accepts this input. The MFP 110 transmits alearning request to the MFP cooperative server 120 (S514), the learningrequest containing information on the input operation performed by theuser at the time of setting a property such as the file name of thescanned image of the form selected by the user (setting-time inputinformation).

In the MFP cooperative server 120 having received the learning request,the request control unit 431 instructs the image processing unit 432 toexecute a learning process (S515). The image processing unit 432 savesthe arrangement information on each text block present in theanalysis-target form image and the information inputted at the time ofsetting its file name, which is contained in the learning requestreceived in S514 (information on the one or more text blocks used forthe file name), as learned data (S516). After the learning process iscompleted, the request control unit 431 returns a response indicatingcompletion of the process to the MFP 110.

The MFP 110 thereafter transmits a request to save the selected formimage to the MFP cooperative server 120 along with the data of this formimage and information on its file name and the like to be set at thetime of transmitting the file of the form image (S517). In the MFPcooperative server 120 having received the save request, the requestcontrol unit 431 starts a file generation process and returns a responseindicating that the save request has been normally received to the MFP110.

In response to receiving this response, the MFP 110 deletes the file ofthe transmitted form image from the list. The MFP 110 then ends itsprocessing and returns to the state of displaying the scan settingscreen (S504).

The MFP cooperative server 120, on the other hand, obtains informationon a file format from preregistered scan settings, and converts the formimage into a file by following this file format (S518). The form imagefile thus generated is transmitted and saved to the storage server 130(S519).

[Storage Save Screen]

FIG. 9 is a diagram illustrating an example of the storage save screen900 displayed on the touchscreen of the MFP 110 in S513. On the storagesave screen 900, the user can check a list of files of form imagesbefore being transmitted to the storage server 130 after being subjectedto the image analysis process.

In the storage save screen 900 in FIG. 9, there are the scanned formlist 901, the “Send” button 902, the “Edit” button 903, and a “Delete”button 904. Further, there are group bars 908, group display switchbuttons 909, a list display order key selection list 910, and a listdisplay filter list 911.

The “Delete” button 904 is a button for deleting the file of a formimage. When the user selects the file of any form image from the scannedform list 901 and presses the “Delete” button 904, the file of theselected form image is deleted from the scanned form list 901.

The “Edit” button 903 is a button for switching to a UI screen forsetting the file name of the file of a form image. When the user selectsthe file of any form image from the storage save screen 900 and pressesthe “Edit” button 903, the MFP 110 accepts this instruction. Then, thedisplay control unit 421 of the MFP 110 issues a request to the MFPcooperative server 120 for the UI screen for setting the file name ofthe selected form image (hereinafter referred to as “file name settingscreen”) and displays the file name setting screen on the touchscreen.

FIGS. 10A and 10B are diagrams illustrating an example of the file namesetting screen 1000. When the user sets the file name on the file namesetting screen 1000 and presses the “OK” button 1007, the screendisplayed on the touchscreen returns to the storage save screen 900illustrated in FIG. 9 again. The file name setting screen 1000 will bedescribed later.

The “Send” button 902 is a button for transmitting the file data of aform image to the storage server 130. When the user selects the file ofany form image from the scanned form list 901 and presses the “Send”button 902, the file of the selected form image is transmitted to thestorage server 130. Note that in a case where the transmission of thefile is normally completed, the information of the single rowcorresponding to the file is deleted from the scanned form list 901.

In the scanned form list 901, information on the files of the formimages obtained by the scans performed this time is displayed in a listform such that each row displays a single file. The scanned form list901 has three items “File name”, “Destination”, “Scan date and time”.

The “File name” is a column that displays the file names of the formimages obtained by the scans performed this time. In a case where a formimage is of a learned form, the file name of this form image isgenerated based on learned input information in a form similar to thisform image and the character recognition result of the form image. In acase where a form image is of an unlearned form, a default file name isgenerated, such as the date and time at which it scan was executed.Then, the generated file name is displayed in the column “File name”.

Note that “learned” refers to a form for which it is determined in theform determination process that a form created with the same form format(similar form) is present in the learned data and information used toset a property of this similar form such as its file name (setting-timeinput information) has been saved. Also, “unlearned” refers to a formfor which setting-time input information has not been saved. That is, ifit is determined that no similar form is present in the formdetermination process, the analysis-target form image is of a new formand is therefore of an unlearned form.

Note that the default file name to be generated for an unlearned formmay be settable by the user before it is scanned. In the case where theuser enters a file name on the above-described file name setting screen1000, that name is displayed.

The “Destination” is an item that displays the name of the storageserver 130, to which the files are to be transmitted. The “Scan date andtime” is an item that displays the date and time at which the forms werescanned.

In the present embodiment, the files of the form images displayed in thescanned form list 901, which have been obtained by the scans performedthis time, can be displayed such that similar forms are arrangedtogether. That is, form images determined as being similar to the sameform in the similar form determination in S508 can be assumed as formsof the same type, and the files of this set of form images can bedisplayed together as one group.

Further, in the present embodiment, unlearned form images can also bedisplayed such that similar form images are displayed together. In thisway, even in a case where a plurality of new forms are scanned, the usercan easily find a desired file from the scanned form list 901, which canease the burden on the user. A method of determining a form similar toan unlearned form image will be described later with reference to FIG.11.

Each of the group bars 908 is a region for displaying a name identifyinga group of files of form images displayed together as the same group(group name). The group bars 908 are added for respective groups.

For example, for learned form images, the group name of a similar formsimilar to them is obtained and displayed as their group name. Forunlearned form images, a group name containing a character stringindicating “unlearned”, for example, is displayed. Meanwhile, thebackground color of the group bars 908 for groups of unlearned formimages may be changed to a different color from that of the group barsfor groups of learned form images. Alternatively, the shapes of thegroup bars may be changed such that the user can easily distinguishthem.

Each of the group display switch buttons 909 is a button for switchingbetween displaying and hiding the files of the individual form images inthe corresponding group. In a case where the group display switch button909 is pressed to set the hiding mode, the rows indicating the files ofthe corresponding individual form images are not displayed, asillustrated by “Unlearned03” in the scanned form list 901 in FIG. 9.This enables only the group bar 908 to be displayed, as illustrated by“Unlearned03” in the scanned form list 901 in FIG. 9.

The list display order key selection list 910 is a pull-down forselecting an item based on which to change the display order of thefiles of the form images displayed in the scanned form list 901.Examples of the items included in the list display order key selectionlist 910 include “Form name”, “Form name (descending order)”, “Fromunlearned”, and “From unlearned (descending order)”.

In a case where “Form name” is selected, the files are sorted such thatthe files of the learned form images are firstly displayed in ascendingorder of their file names and then the files of the unlearned formimages are displayed in ascending order of their file names. In a casewhere “Form name (descending order)” is selected, the files are sortedsuch that the files of the learned form images are firstly displayed indescending order of their file names and then the files of the unlearnedform images are displayed in descending order of their file names. In acase where “From unlearned” is selected, the files are sorted such thatthe files of the unlearned form images are firstly listed in ascendingorder of their file names and then the files of the learned form imagesare listed in ascending order of their file names. In a case where “Fromunlearned (descending order)” is selected, the files of the form imagesare sorted such that the files of the unlearned form images are firstlylisted in descending order of their file names and then the files of thelearned form images are listed in descending order of their file names.

Ascending order is, for example, an order of arrangement such as anumerical order from “1” to “9” and an alphabetical order from “A” to“Z”. Likewise, in the following, ascending order or descending ordermeans sorting based on the file names.

The list display filter list 911 is a pull-down for selecting an itembased on which to display only the files of particular form images inthe scanned form list 901. Examples of the item include “All”,“Learned”, “Unlearned”, and “File name yet to be entered”.

“All” is an item for displaying the files of all displayable formimages. “Learned” is an item for displaying the files of only thelearned form images among the displayable form images. “Unlearned” is anitem for displaying the files of only the unlearned form images. “Filename yet to be entered” is an item for displaying the files of only theform images whose file names are default values.

For example, in FIG. 9, “All” is selected in the list display filterlist 911, so that the files of all form images are displayed. Byselecting, for example, “Learned” in the list display filter list 911,the files in the groups “Unlearned01”, “Unlearned02”, and “Unlearned03”can be hidden.

[Setting File Name]

The user can set the file name of a form image to be saved to thestorage server 130 with the file name setting screen 1000 illustrated inFIGS. 10A and 10B. The file name setting screen 1000 in FIG. 10A is adiagram illustrating an example of the screen with no text blockselected. A file name entry field 1001 is a region for displaying a filename set by the user. Also, touching the blank portion in the file nameentry field 1001 displays a software keyboard, with which the user canenter any characters. In a state where a character string to be the filename is entered and displayed, touching this character string displays asoftware keyboard for correcting the character string in the touchedportion, with which the entered and displayed characters can becorrected. A preview region 1002 is a region for displaying a preview ofthe form image.

Further, by touching a text block in the displayed preview image, theuser can add the character string in the text block at the touchedposition into the file name entry field 1001 as a character string thatforms the file name. The text block selected with a touch or the likemay be highlighted by adding a line, a frame, a color, or the like tothe text block or the like so that the user can notice that it has beenselected. Also, in a case where the user selects a plurality of textblocks, the colors of the text blocks may be set to different colors.Also, the display position and magnification of the preview may bechanged such that the selected text block appears at the center. Also,in a case where there are a plurality of text blocks, the displayposition of the preview may be calculated such that a preset number oftext blocks will be displayed. For example, the preview may be displayedwith its display position and magnification changed such that the centerbetween the uppermost region and the lowermost region among regions usedfor the file name appears at the vertical center of the preview region.Touching a once selected text block again unselects the text block,deletes the corresponding character string, and brings the text blockback to the state where the line, color, or the like added to it is notdisplayed.

In the file name setting screen 1000 in FIG. 10A, no text block isdisplayed in the preview image. However, text blocks may be displayed ina visually recognizable manner by using a color or frames in order toindicate the selectable portions to the user. Meanwhile, a button or thelike may be used to switch between enabling and disabling the visuallyrecognizable display of the text blocks. Performing a swipe operation onthe preview image can move the position of the preview image displayedin the preview region 1002.

A deletion button 1003 is a button for deleting the characterscorresponding to the trailing text block among the characters displayedin the file name entry field 1001. An enlargement button 1004 is abutton for increasing the display magnification of the preview imagedisplayed in the preview region 1002. A reduction button 1005 is abutton for reducing the display magnification of the preview imagedisplayed in the preview region 1002. At the time of enlargement andreduction, the display position is adjusted such that the coordinates ofthe center of the preview region 1002 remains the same as before theenlargement and reduction. An initial display button 1006 is a buttonfor setting the display magnification and the display position of thepreview image back to their initial states in a case where the displayposition has been changed by a swipe operation and/or the displaymagnification has been changed with the enlargement button 1004 or thereduction button 1005. An OK button 1007 is a button for bringing the UIscreen back to the above-described storage save screen 900 illustratedin FIG. 9 at the point when the file name is set, in order to transmitthe form image data to the MFP cooperative server 120 along with the setfile name.

FIG. 10B is a diagram illustrating the file name setting screen 1000 ina case where a text block detected as “Quotation” by characterrecognition is firstly selected and a text block detected as“ShimomarukoCorporation” by character recognition is subsequentlyselected. In the file name entry field 1001, two character stringscorresponding to the text blocks have been entered in the order in whichthey were touched, and an underscore has been entered as a separatorbetween the two character strings. Note that the underscore may beentered manually by the user and entered automatically.

[Details of Scanned Image Analysis Process]

Next, details of the analysis process (S508) performed by the MFPcooperative server 120 in the present embodiment will be specificallydescribed along a flowchart in FIG. 11. Specifically, the CPU of the MFPcooperative server 120 executes the series of processes illustrated inthe flowchart in FIG. 11 by loading program code stored in the ROM intothe RAM and executing it. Also, the functions of some or all of thesteps in FIG. 11 may be implemented with hardware such as an ASIC or anelectronic circuit. Meanwhile, the symbol “S” in the description of eachprocess means a step in the flowchart. From among the form imagesobtained by the scans performed this time, one analysis-target formimage is selected, and the processes in the flowchart in FIG. 11 areperformed on the analysis-target form image.

In S1101, the image processing unit 432 obtains the analysis-target formimage associated with the analysis request. The image processing unit432 then detects the text blocks present in the analysis-target formimage. As a result, the arrangement and sizes of the text blockscontained in the analysis-target form image are identified on a percharacter string basis.

In S1102, the image processing unit 432 performs a character recognitionprocess on all text blocks detected by analyzing the form image inS1101.

In S1103, the image processing unit 432 executes the form determinationprocess described earlier on the analysis-target form image. In S1104,whether a similar form having a document format matching the documentformat of the analysis-target form image is found from the learned datais determined based on the result of the process in S1103. If a similarform is found (YES in S1104), then in S1105, the image processing unit432 generates analysis result information indicating the result of theanalysis.

FIGS. 12A and 12B are diagrams illustrating examples of the datastructure of the analysis result information on the form image generatedin S1105 or S1107 to be described later. FIG. 12A illustrates an exampleof the data structure of the analysis result information in a case wherea form in the learned data determined as being similar to theanalysis-target form image is a learned form for which setting-timeinput information has been saved.

If the analysis-target form image is of a learned form, the imageprocessing unit 432 obtains the text blocks among all text blocksdetected from the analysis-target form image which correspond to thetext blocks registered as the setting-time input information in thesimilar form. In the present embodiment, the text blocks registered asthe setting-time input information are text blocks containing thecharacter strings used in the file name of the similar form. The imageprocessing unit 432 then generates analysis result information based onthe text blocks in the analysis-target form image corresponding to theregistered text blocks in the similar form.

In the analysis result information in FIG. 12A generated in this step,“matched” stores a value based on the result of the form determinationprocess, indicating whether a similar form similar to theanalysis-target form image has been found from the learned data.“matched” holds “true” if a similar form is present, and holds “false”if no similar form is present.

“formId” is an item holding a value serving as an identifier uniquelyindicating the type of the form. If a similar form is present, “formId”holds the same value as the value given to this similar form.

“matchingScore” stores a value indicating the degree of similaritybetween the analysis-target form image and the similar form. The valueheld in “matchingScore” is a real number value from 0 to 1 representingthe degree of match between the arrangement information on the textblocks in the similar form and the arrangement information on the textblocks in the analysis-target form image. Here, a larger value indicatesthat the analysis-target form image is an image of a form more similarto the similar form.

“rectInfoArray” holds information indicating the text blocks in theanalysis-target form image corresponding to the text blocks in thesimilar form containing the character strings designated by the user toset the file name of the similar form. “rectInfoArray” contains one ormore sets of pieces of information “key”, “region”, “rect”, and “text”.

“key” holds a unique value identifying a text block to be used for thefile name. “region” holds information on the text block identified bythe corresponding “key”. Further, in “region”, there are “rect” as anitem holding coordinate information on the text block and “text” as anitem holding the character string recognized from the text block. In“rect”, “x” represents the X coordinate of the upper left corner of thetext block, “y” represents the Y coordinate of the upper left corner,“width” represents the number of pixels in the X direction, and “height”represents the number of pixels in the Y direction. “text” is an itemholding the character string obtained by performing an OCR process onthe text block indicated by “rect”.

“metadataArray” is an item holding information indicating the order ofthe text blocks to be used for the file name and a delimitertherebetween for automatic entry of the file name. “metadataArray”contains pieces of information “key”, “keyType”, and “value”.

“key” stores a value uniquely indicating a setting item to be set forthe form image. “keyType” stores a value indicating the type of thesetting value in “key”. “key” holds “filename” and “keyType” holds“filename” in a case of using the information held in “rectInfoArray”for the file name.

“value” holds information on the order of the character strings to beused and a character that separates the character strings for generationof a setting value based on the value of “key”. The example of FIG. 12Aindicates generation of the setting value (file name) by using theregion with “key” holding “fileRegion0” in “rectInfoArray”, a delimiter,and the region with “key” holding “fileRegion1” in “rectInfoArray” inthis order. More specifically, the example of FIG. 12A indicatesgeneration of a file name “Quotation ShimomarukoCorporation”.

Also, in a case where other properties than the file name such asmetadata are set, necessary information is added to “rectInfoArray” and“metadataArray”.

If, on the other hand, no similar form is present (NO in S1104), then inS1106, the image processing unit 432 instructs the data management unit434 to register information on the arrangement of the text blocks in theanalysis-target form image and the like in the learned data as thedocument format of a new form.

Specifically, at the point of this step, setting-time input informationon the new form such as its file name has not been set by the user, butat least the arrangement information on the text blocks in theanalysis-target form image is registered in the learned data asinformation on the new form. In other words, the arrangement informationon the text blocks in an unlearned form is registered in the learneddata.

Among the plurality of scanned images obtained by the scans performedthis time, the next analysis-target scanned image will be matched withthe learned forms registered in the learned data before the execution ofits scan and also will be matched with the new form registered in S1106to determine whether the next analysis-target scanned image is similarto any one of them.

In a case where a property such as the file name is set by the userafter the end of this flowchart, the information on the text blocksregistered in S1106 is registered in the learned data as a learned formalong with this setting-time input information.

In S1107, the image processing unit 432 generates analysis resultinformation on the analysis-target form image. FIG. 12B is a diagramillustrating the analysis result information in the case where theanalysis-target form image is an unlearned form. As illustrated in FIG.12B, if no similar form is present, the analysis result informationincludes only the items “matched” and “formId”. Note that as for thevalue of “formId”, a new value is given as a new form, and “formId”holds that value.

In S1108, the image processing unit 432 stores the analysis resultinformation generated in S1105 or S1107 in the HDD of the MFPcooperative server 120, and ends the processing in this flowchart. Afterthe end of the processing in the flowchart in FIG. 11, the processing iscontrolled such that an unanalyzed form image is selected as theanalysis-target form image from among the form images obtained by thescans performed this time and this flowchart is repeated until there isno unanalyzed form image.

In a case where the next analysis-target form image is similar to thenew form registered in the process in S1106, the image processing unit432 determines in S1104 that a similar form is present (YES in S1104),and proceeds to S1105. In this case, since the new form is an “unlearnedform” for which no setting-time input information has been saved, theanalysis-target form image is an “unlearned form image” even if theresult of the determination in S1104 is YES. Thus, in the process inS1105 in the case where the analysis-target form image is determined asbeing similar to the new form, the analysis result information isgenerated with no information stored in or after “rectInfoArray” in theanalysis result information in FIG. 12A. Note that as for the value tobe held in “formId”, the same value of “formId” as that given to thesimilar form image is given.

The pieces of analysis result information thus generated will be used tocreate the storage save screen 900 in S513. With these pieces ofanalysis result information, the files of form images having the samevalue of “formId” are grouped and displayed together. In particular, inthe present embodiment, in a case where an unlearned form image forwhich no setting-time input information has been saved is similar toanother unlearned form image, “formId” in its analysis resultinformation holds the same value as that of the similar unlearned formimage.

For example, for the form images with the file names “201911201100(1)”and “201911201100(3)” in the scanned form list 901 in FIG. 9, the samevalue of “formId” is given in the respective pieces of analysis resultinformation as a result of the flowchart in FIG. 11. Thus, images of thesame document format can be displayed together in the scanned form list901 even if they are of unlearned forms. Hence, even in a case where aplurality of new forms are scanned, the user can check the files oftheir form images by type.

A group name corresponding to the value held in “formId” is stored, forexample, in the HDD of the MFP cooperative server 120. This group nameassociated with the value held in “formId” is obtained and displayed asthe group name in the corresponding group bar 908 in the storage savescreen 900.

Each group name may be generated based on, for example, the file namesof the corresponding form images generated by combining the characterstrings in text blocks in them. Each group name may also be editable bythe user. For example, as for the group name of a form registered as anew form in S1106, a group name formed of a character string indicatingthat the form is an unlearned form and a serial number is generated andstored in the HDD of the MFP cooperative server 120 in association withthe value of “formId”.

In the case where a group name is generated or edited, the group nameassociated with the value of “formId” is also updated. For example, inS514, the one or more character strings forming the group name and theposition information on the corresponding text blocks are also sent tothe MFP cooperative server 120 and, in S516, the group name is updatedand saved. Note that the analysis result information may be providedwith a region to hold a group name and a group name may be held in theanalysis result information. Here, methods of determining and updating agroup name will be described in detail in later-described Embodiments 2and 3.

As described above, according to the present embodiment, the files ofform images obtained by scanning unlearned new forms for which nosetting-time input information has been saved can also be displayed suchthat similar forms are grouped together. In this way, even in the casewhere a plurality of new forms are scanned, the user can easily find thefile of a desired new form image. Thus, the burden on the user can beeased.

Note that while the storage save screen 900 and the file name settingscreen 1000 have been described as screens to be displayed on thetouchscreen of the MFP 110, they may be displayed on a display unit ofanother information processing apparatus in the image processing system100 such as the client PC. In this case, the client PC issues a requestto the MFP cooperative server 120 for screen information for displayingthe storage save screen 900 and the file name setting screen 1000. Byusing the pieces of analysis result information, the MFP cooperativeserver 120 returns the client PC screen information on the storage savescreen 900 for displaying the respective form images such that thosewith the same value of “formId” are displayed together. In this casetoo, the user can check the files of the form images by form type on theweb browser of the client PC.

Embodiment 2

In Embodiment 2, a description will be given of a method in which ananalysis-target form image and a learned form determined as beingsimilar to the analysis-target form image are compared with each otherto update the group name registered in association with the value oftheir “formId” to a more appropriate name. In the present embodiment,its difference from Embodiment 1 will be mainly described. Features thatare not particularly specified are the same components and processes asthose in Embodiment 1.

FIGS. 13A and 13B are diagrams illustrating an example of a “group namelist” in a table form, the “group name list” holding group namesassociated with values of “formId”. The “group name list” is stored inthe HDD of the MFP cooperative server 120.

FIG. 13A illustrates an example of the group name list stored beforescans are executed this time. The group name list will be described withreference to FIG. 13A. The group name list holds information on groupnames associated with respective rows (records), i.e., respective valuesof “formId”. The group names held by form type in the group name listhave been registered based on the analysis results or learning resultsobtained when previous forms were scanned. The registration of a newform in the group name list will be described in Embodiment 3.

Next, the items in the group name list (the columns in the table) willbe described. “groupNo” holds serial numbers in a database. “formId”holds values uniquely indicating form types, as explained in FIGS. 12Aand 12B.

“formName” holds group names associated with the respective form types,i.e., the respective values of “formId”. “fileRegion0” to “fileRegionN”each hold part of the character strings used in the group names. Eachgroup name has been generated using part of the file name of a formimage. “fileRegion0” to “fileRegionN” in the group name list hold thecharacter strings in “text” of “fileRegion0” to “fileRegionN” containedin the analysis result information explained in FIGS. 12A and 12B, whichindicate the character strings in the text blocks to be used to generatea file name. Note that, of “fileRegion0” to “fileRegionN” in theanalysis result information, only the information in one or more“fileRegion”s with the character strings used in the group name istransferred to the corresponding one or ones of “fileRegion0” to“fileRegionN” in the group name list. That is, not all character stringsforming the file name are necessarily used in the group name. Thus, ofthe character strings in “fileRegion0” to “fileRegionN” in the analysisresult information, only the one or more character strings used in thegroup name are held. Note that the information on the positions of thetext blocks from which the character strings held in “fileRegion0” to“fileRegionN” are recognized is also held but is omitted.

The user can modify each group name, and “userText” holds any characterstring designated by the user and accepted by the MFP cooperative server120. In a case where duplicate group names are generated, “duplicateNum”holds serial numbers to be added to these group names so that the groupnames can be distinguished from each other.

Each group name held in “formName” is formed of the character stringsheld in “fileRegion0” to “fileRegionN”, “userText”, and “duplicateNum”and a separator(s). As the separator, the same character as thedelimiter used in file names may be used, or a different character maybe set. The present embodiment will be described on the assumption thatthe separator is (underscore). “formNameArray” holds the order ofarrangement of the character strings in each group name, which is formedof the character strings in “fileRegion0” to “fileRegionN”, “userText”,and “duplicateNum” and the separator(s).

In a case where the analysis-target form image in the flowchart in FIG.11 is of a learned form, the group name associated with its value of“formId” is obtained from the group name list. In the presentembodiment, however, the currently stored group name is not used as is.In the present embodiment, the information of the group name list andthe character strings in the analysis-target form image are comparedwith each other, and the group name currently held in the group namelist is updated to an appropriate name.

FIG. 14 is a flowchart for explaining a process which, in a case wherethe form images obtained by the scans performed this time include alearned form image, updates a group name with this learned form image asthe process target. This flowchart is executed after the analysisprocess illustrated in FIG. 11 is performed on all form images obtainedby the scans performed this time.

In S1401, from the group name list saved in the HDD of the MFPcooperative server 120, the image processing unit 432 obtains theinformation on the group name associated with the value of “formId” inthe analysis result information on the process-target form image.

Assume, for example, that the value of “formId” of a similar form hasbeen obtained in S1105, and “aaaaaaaa-ffff-49ab-acf8-55558888eeee” hasbeen held as the value of “formId” in the analysis result information ofthe process-target form image. In this case, in the current group namelist in FIG. 13A, the information of the record with “groupNo” of 1, inwhich “formId” is “aaaaaaaa-ffff-49ab-acfl3-55558888eeee”, is obtainedas the information on the group name. Note that, in FIG. 13A, “InvoiceShimomarukoCorporation” is held as the group name associated with“formId” of “aaaaaaaa-ffff-49ab-acf8-55558888eeee”. In the following,this flowchart will be described on the assumption that the informationon the group name of the similar form similar to the process-target formimage is the information of the record with “groupNo” of 1.

In S1402, in a case where character strings in the similar form are usedin the group name obtained in S1401, the image processing unit 432obtains the position information on the text blocks of these characterstrings. The image processing unit 432 then identifies the text blockspresent at the same positions as the obtained text block positions amongthe text blocks in the process-target form image, and obtains thecharacter strings contained in these text blocks. Here, being the sameis not limited to being exactly the same. Corresponding text blockspresent at nearby positions may be included as the text blocks at thesame positions.

For example, in the group name list in FIG. 13A at the start of thescans, the record with “groupNo” of 1 holds character strings in“fileRegion0” and “fileRegion1”. The group name is therefore formedusing the character strings held in the text blocks indicated by“fileRegion0” and “fileRegion1”. Thus, the position information on thetext blocks at “fileRegion0” and “fileRegion1” in the record with“groupNo” of 1 is obtained, and the character strings in the text blocksin the process-target form image at the same positions as these textblocks are obtained.

Note that in a case where the form images obtained by the scansperformed this time include a plurality of form images determined asbeing similar to the same form in the learned data, the characterstrings are similarly obtained from the other form image(s) as well.

In S1403, the image processing unit 432 compares the character stringsused in the group name in the group name list and the character stringsobtained in S1402 with each other.

FIG. 15 is a diagram in which the character strings forming the groupname held in the current group name list and the character stringsobtained in S1402 are organized in a table form for the purpose ofcomparison. The record with “No” of 1 in the table in FIG. 15 indicatesthe character strings used in the group name in the record with“groupNo” of 1 in the group name list. In the record with “groupNo” of1, the character string in “fileRegion0” is “Invoice” and the characterstring in “fileRegion1” is “ShimomarukoCorporation”. The record with“No” of 2 in the table in FIG. 15 indicates the character strings in thetext blocks which correspond to “fileRegion0” and “fileRegion1” andobtained from the process-target form image. In the process-target formimage, the character string in “fileRegion0” is “Quotation”, and thecharacter string in “fileRegion1” is “ShimomarukoCorporation”.

As illustrated in FIG. 15, the character strings in “fileRegion1” arethe same whereas the character strings in “fileRegion0” are “Invoice” inthe registered group name and “Quotation” in the process-target formimage. As described above, comparing the character strings in theprocess-target form image and those in the currently registered groupname with each other may indicate that text blocks at the same positioncontain different character strings. Then, if the registered group nameis used as is as the group name of the process-target form image and thegroup name is displayed with “Invoice” included therein, a group namewith a character string not contained in the form image is displayed.For the user, this is not preferable for finding the file of the formimage. Hence, in the present embodiment, the previously generated groupname is not used as is but the group name is updated.

In S1404, the image processing unit 432 generates a group namerepresenting the type of the process-target form image by using amatching character string among the character strings forming thecurrent group name and the character strings obtained in S1402 based onthe result of the comparison in S1403. The non-matching characterstrings are not used in the group name. In the example of FIG. 15, theimage processing unit 432 generates the group name by using thecharacter strings in “fileRegion1”, which match each other between bothforms. For example, the image processing unit 432 sets the characterstring “ShimomarukoCorporation” in “fileRegion1” as the group name. Theimage processing unit 432 then updates the group name held in “formName”in the record with “groupNo” of 1 in the group name list to“ShimomarukoCorporation”. As a result, the group name list is now in astate where the updated group name “ShimomarukoCorporation” is held in“formName” in the record with “groupNo” of 1, as illustrated in FIG.13B.

The group name updated in S1404 will be used to create the screeninformation of the scanned form list 901 in S512. Specifically, thegroup names registered in the updated group name list will be displayedin the group bars 908 in the scanned form list 901.

As described above, according to the present embodiment, a characterstring common to similar forms is used as their group name among thecharacteristic character strings used in their file names or the like.This enables generation of a characteristic group name common to allsimilar forms. Thus, even in a case where the number of forms similar toforms in the learned data increases, it is possible to reduce the burdenof an operation in which the user him- or herself performs maintenanceon characteristic group names so as to make the groups distinguishable.It is therefore possible to save the user's effort and time for findinga desired form even in a case where a plurality of form image groups arepresent in a mixed manner in a list.

Embodiment 3

Embodiment 1 has been described such that the group name of the files ofunlearned form images is generated using a character string indicatingthat the forms are unlearned and a serial number, like “Unlearned01”.However, in a case where the form images obtained by the scans performedthis time include many unlearned form images, it will be difficult forthe user to find the file of a form image. In view of this, inEmbodiment 3, a description will be given of a method of determining anappropriate group name for the files of unlearned form images anddisplaying the files under the group name. In the present embodiment,its difference from Embodiment 2 will be mainly described. Features thatare not particularly specified are the same components and processes asthose in Embodiment 2.

[Process of Generating Group Name of Unlearned Form]

FIG. 16 is a flowchart for determining the group name of an unlearnedform among the form images obtained by the scans performed this time.This flowchart is executed after the analysis process illustrated inFIG. 11 is performed on all forms scanned this time. Also, thisflowchart will be described on the assumption that one of any form imagefor which it has been determined that no similar form is present inS1104 in FIG. 11 is the process target. In a case where there are aplurality of form images for which it has been determined that nosimilar form is present, the processing in this flowchart is performedon each of them as the process targets.

In S1601, the image processing unit 432 generates a name combining acharacter string indicating that the form is an unlearned form, and anumeric character. The numeric character is a serial number that isincremented each time a group name is generated for an unlearned file.For example, for the first form image to be processed in this flowchart,a name “Unlearned01” is generated.

Note that the name to be generated is not limited to a name such as“Unlearned01”. Instead of the character string “Unlearned”, anothercharacter string indicating that information for setting a property hasnot been saved (such as “New”) may be used. Also, instead of the numericcharacter representing a serial number, a character for distinguishingthe group name from others (such as an alphabetic character) may beused. This flowchart will be described on the assumption that“Unlearned01” is generated in S1601.

In S1602, the image processing unit 432 obtains the character stringsused in the group names registered in the group name list stored in theHDD of the MFP cooperative server 120.

Assume that FIG. 13B is the group name list stored at the start of thisflowchart. In this case, the character strings held in “fileRegion0” to“fileRegionN” in the group name list in FIG. 13B are obtained in thisstep. The character strings used in the group names before the start ofthis flowchart are “Quotation”, “ShimomarukoCorporation”,“KawasakiCorporation”, “ShinagawaCorporation”, “TokyoCorporation”, and“Invoice”, excluding the duplicate character strings. Thus, thesecharacter strings are obtained in this step. That is, the characterstrings in the group names of the registered learned forms are obtained.

In S1603, the image processing unit 432 searches the process-target formimage for text blocks containing any of the character strings obtainedin S1602. The result of an OCR process on each form image has beenstored as “character recognition result”. Thus, the image processingunit 432 searches the character recognition result of the process-targetform image for the target character strings to search for text blockswith any of them.

FIGS. 18A and 18B are diagrams illustrating form images scanned thistime. FIG. 18A illustrates the process target unlearned form image inthis flowchart. FIG. 18B is a diagram illustrating another form imagesimilar to the form image in FIG. 18A, which will be described later.

FIGS. 19A and 19B are diagrams illustrating, in a table form, thecharacter recognition results obtained by performing an OCR process onthe form images. The character recognition result in FIG. 19A is anexample of the character recognition result obtained by performing anOCR process on the form image in FIG. 18A. The character recognitionresult in FIG. 19B is an example of the character recognition resultobtained by performing an OCR process on the form image in FIG. 18B. Thecharacter recognition results have been saved in, for example, the HDDof the MFP cooperative server 120.

The character recognition results in FIGS. 19A and 19B will now bedescribed. The records in the character recognition results (the rows inthe tables) hold a value or text data of each of items “rectNo”, “x”,“y”, “width”, “height”, and “text”, and a record is generated for eachtext block. “rectNo” is a column holding serial numbers of the textblocks.

“x” is a column holding the x coordinates of the upper left corners ofthe text blocks, “y” is a column holding the y coordinates of the upperleft corners of the text blocks, “width” is a column holding the numbersof pixels in the text blocks in the X direction, and “height” is acolumn holding the numbers of pixels in the text blocks in the Ydirection. “text” is a column holding the character strings contained inthese text blocks. In the present embodiment, for the coordinates ineach image, a coordinate system is used in which the origin is the upperleft corner and the vertical direction and the horizontal directionextend in the Y direction and the X direction respectively, for example.

Assume that the character strings obtained in S1602, which are used inthe previous group names, are “Quotation”, “ShimomarukoCorporation”,“KawasakiCorporation”, “ShinagawaCorporation”, “TokyoCorporation”, and“Invoice”. Assume also that the character recognition result of theprocess-target form image is FIG. 19A. In this case, in S1603, the imageprocessing unit 432 sets the character strings in the column “text” inFIG. 19A as the search range. Then, the image processing unit 432searches the search range for any of the character strings “Quotation”,“ShimomarukoCorporation”, “KawasakiCorporation”, “ShinagawaCorporation”,“TokyoCorporation”, and “Invoice” to search for text blocks with any ofthem. By the process in this step, the text block with “rectNo” of 1,which contains “Quotation”, is obtained from the character recognitionresult in FIG. 19A as the search result. Similarly, the text block with“rectNo” of 4, which contains “QuotationNo:”, and the text block with“rectNo” of 11, which contains “KawasakiCorporation”, are obtained asthe search result.

In a case where the same character string is present in a plurality oftext blocks, the text block with the higher or highest degree of matchwith the search value is kept as the search result. For example,“Quotation” in “rectNo” of 1 and “QuotationNo:” in “rectNo” of 4 arecharacter strings both containing “Quotation”, but the character stringin the text block with “rectNo” of 1 has a higher degree of match with“Quotation”. For this reason, the text block with “rectNo” of 4 isexcluded from the search result.

In S1604, the image processing unit 432 determines whether one or moretext blocks containing any of the character strings used in the groupnames are present in the character recognition result of theprocess-target form image, based on the result of the search in S1603.If one or more text blocks are present (YES in S1604), the imageprocessing unit 432 proceeds to S1605.

In S1605, the image processing unit 432 obtains the character stringscontained in the text blocks obtained by the search in S1603.Specifically, of the character strings used in the other group names,the character strings also contained in the process-target unlearnedform image are obtained by this step. For example, in the case of thecharacter recognition result in FIG. 19A, the character strings“Quotation” and “KawasakiCorporation” are obtained from the text blockswith “rectNo” of 1 and 11, respectively.

In S1606, the image processing unit 432 determines whether an unlearnedform image similar to the process-target form image in this flowchart ispresent among the form images obtained by the scans performed this time.Specifically, the process-target form image in this flowchart is a formimage for which it has been determined that a form similar to it is notpresent in the learned data, but has been registered in the learned dataas a new form in S1106. For this reason, in a case where a form imagebeing an unlearned form image obtained by a scan performed this time issimilar to the process-target form image in this flowchart, “formId” inits analysis result information has been given the same value as thevalue of “formId” of the process-target form image. Thus, it isdetermined that an unlearned form image similar to the process-targetform image in this flowchart is present if there is a form image holdingthe same value of “formId”, for example.

FIG. 18B illustrates an example of a form image among the form images ofa plurality of unlearned forms obtained by scans performed this timewhich has been determined as being similar to the form image in FIG.18A, which is the process target in this flowchart.

If a similar unlearned form is present (YES in S1606), then in S1607,the image processing unit 432 obtains the character strings in the textblocks in the character recognition result of the similar unlearned formimage that are present at the same positions as the text blocks obtainedby the search in S1603. Here, being the same is not limited to beingexactly the same. Corresponding text blocks such as ones present atnearby positions may be included as the text blocks at the samepositions.

FIG. 19B illustrates a character recognition result obtained byperforming an OCR process on the unlearned form image in FIG. 18Bsimilar to the process-target form image. In S1607, from FIG. 19B, thetext block with “rectNo” of 1 in FIG. 19B is identified, which ispresent at a position corresponding to, that is, at the same position asor a position near, the text block with “rectNo” of 1 in FIG. 19Aobtained by S1603. Moreover, the text block with “rectNo” of 11 in FIG.19B is identified, which is present at a position corresponding to thetext block with “rectNo” of 11 in FIG. 19A. Then, from the characterrecognition result in FIG. 19B, the character strings “Quotation” and“KawasakiCorporation” are obtained from the text blocks with “rectNo” of1 and 11, respectively.

Note that in a case where there are a plurality of similar unlearnedform images, character strings are obtained from these form images in asimilar manner. Then, the image processing unit 432 compares thecharacter strings obtained in S1607 and the character strings obtainedin S1605 with each other.

In S1608, the image processing unit 432 determines whether one or morematching character strings are present based on the result of thecomparison in S1607 between the character strings obtained from theprocess-target form image and the character strings obtained from theform image similar to the process-target form image. If one or morematching character strings are present (YES in S1608), the imageprocessing unit 432 proceeds to S1609. For example, the characterstrings in “rectNo” of 1 and 11 in FIG. 19B are “Quotation” and“KawasakiCorporation”, respectively. In the example of FIGS. 19A and19B, both “Quotation” and “KawasakiCorporation” are the same characterstrings as the character strings obtained from the process-target formimage. Thus, the image processing unit 432 determines that matchingcharacter strings are present.

In S1609, the image processing unit 432 adds the character stringsdetermined as matching character strings by the comparison in S1607 tothe name generated in S1601 to thereby generate a group name. Note thatthe non-matching character strings are not used in the group name. Inthe case of adding a new character string to a name, providing adelimiter makes the boundary between the elements noticeable. Thus, forexample, the character strings may be added with (underscore) insertedbetween the character strings.

For example, if the result of the comparison in S1607 indicates that thecharacter strings common to the plurality of forms are “Quotation” and“KawasakiCorporation”, then, the group name generated in S1609 is“Unlearned01_Quotation_KawasakiCorporation”.

On the other hand, if another unlearned form image similar to theprocess-target form image is not present (NO in S1606), then in S1610,the image processing unit 432 adds the character strings obtained inS1605 to the name generated in S1601 to thereby generate a name.

In S1611, the image processing unit 432 stores the generated group nameof the new form in the group name list saved in the HDD of the MFPcooperative server 120. Specifically, the generated group name is storedin the group name list in association with the value of “formId” of theprocess-target form image.

FIGS. 17A and 17B are diagrams illustrating the group name list in theform of a table. FIG. 17A illustrates the group name list after thegroup name of the unlearned form image is stored. Assume that the valueof “formId” of the process-target form image in this flowchart is“fffffff-kkkk-81ab-acf9-77772222jjjj”. Assume also that the group namegenerated by the processing in this flowchart is“Unlearned01_Quotation_KawasakiCorporation”. In this case, the generatedgroup name is stored in association with the value of “formId”, asindicated by the record with “groupNo” of 6 in FIG. 17A.

Meanwhile, there is a case where it is determined in S1604 that thecharacter recognition result of the process-target form image includesnone of the text blocks containing in the character strings used in thegroup names (NO in S1604). In this case, the name generated in S1601 isstored as the group name in association with the value of “formId” ofthe process-target form image. Specifically, “Unlearned01” generated inS1601 is stored in the group name list in association with the value of“formId” of the process-target form image.

Likewise, in a case where no matching character string is present (NO inS1608), the name generated in S1601 is also stored as the group name inassociation with the value of “formId” of the process-target form image.

[Process of Updating Group Name of Unlearned Form]

In a case where the user enters a file name for the unlearned form imageafter the end of the flowchart in FIG. 16, the group name generated inthe flowchart in FIG. 16 is updated according to this entered file name.

FIG. 20 is a flowchart of a process of updating a group name after fileinformation is entered. The processing in this flowchart is executed ona form image, as the process target, whose file name is entered by theuser.

In S2001, the image processing unit 432 obtains the character stringsused in the file name of the process-target form image and, if thesecharacter strings are character strings selected from among thecharacter strings in the process-target form image, obtains the positioninformation on the text blocks containing these character strings.

FIG. 21 illustrates the analysis result information on the unlearnedform image updated after the user has entered information for generatingits file name (file information). In FIG. 21, the region at and below“rectInfoArray” holds data in the same structure as that in the analysisresult information in FIG. 12A. The example of FIG. 21 indicates thatthe user has generated a file name using the character string in“fileRegion0”, a delimiter, the character string in “fileRegion1”, adelimiter, and the character string in “fileRegion2” in this order.Thus, from the updated analysis result information, the characterstrings in the file name entered by the user and their text blocks areobtained.

Specifically, in the case of FIG. 21, the position information on thetext block “fileRegion0” and the character string “Quotation” containedin this text block are obtained by S2001. Similarly, the positioninformation on the text block “fileRegion1” and the character string“KawasakiCorporation” therein, and the position information on the textblock “fileRegion2” and the character string “M5432” therein areobtained.

Note that in the case where file information is entered, the enteredfile information may be stored as data separate from the analysis resultinformation.

In S2002, the image processing unit 432 determines whether an unlearnedform image similar to the process-target form image in this flowchart ispresent among the form images obtained by the scans performed this time.

The method of making the determination is similar to that in S1606, inwhich, for example, it is determined that an unlearned form imagesimilar to the process-target form image in this flowchart is present ifthere is a form image holding the same value of “formId” as the value of“formId” of the process-target form image.

If a similar unlearned form is present (YES in S2002), then in S2003,the image processing unit 432 obtains the character strings contained inthe text blocks in the other form image similar to the process-targetform image that are present at the same positions as the text blocksobtained in S2001. Since the character recognition results obtained byperforming an OCR process on the form images have been stored, the imageprocessing unit 432 searches the character recognition results for thetext blocks and obtains the character strings therein. Here, being thesame is not limited to being exactly the same. Corresponding text blockspresent at nearby positions may be included as the text blocks at thesame positions. Note that in a case where there are a plurality ofsimilar unlearned forms, character strings are obtained from theseunlearned forms in a similar manner.

Then, among the character strings obtained in S2003 and the characterstrings obtained in S2001, the image processing unit 432 compares thecharacter strings in the corresponding text blocks present at the sameor nearby positions with each other.

FIG. 22 is a diagram in which the character strings in each form areorganized in a table form for the purpose of comparison between thecharacter strings obtained in S2001 and the character strings obtainedin S2003. The record with “No” of 1 in the table in FIG. 22 indicatesthe character strings obtained in S2001. The record with “No” of 2 inthe table indicates the character strings obtained in S2003. Asillustrated in FIG. 22, the character strings in “fileRegion0” and“fileRegion1” in the two forms match each other whereas the characterstrings in “fileRegion2” in the two forms differ from each other.

In S2004, the image processing unit 432 generates a group namerepresenting the type of the process-target form image by using thematching character strings among the character strings in theprocess-target form image and the character strings in the form imagesimilar to the process-target form image based on the result of thecomparison in S2003. In the example of FIG. 22, the image processingunit 432 generates the group name by using the character string“Quotation” in “fileRegion0” and the character string“KawasakiCorporation” in “fileRegion1”, which match each other betweenboth forms. The character strings may be added with (underscore)inserted between the character strings. For example, a group name suchas “Quotation_KawasakiCorporation” is generated.

If no similar unlearned form is present (NO in S2002), then in S2005,the image processing unit 432 generates the group name by using thecharacter strings used in the file name.

In 52006, the image processing unit 432 updates the group name liststored in the HDD of the MFP cooperative server 120 based on thegenerated group name. FIG. 17B illustrates an example of the group namelist updated by the processing in this flowchart. Assume that the valueof “formId” of the process-target form image in this flowchart is“fffffff-kkkk-Blab-acP9-77772222jjjj”, and the group name generated bythe processing in this flowchart is “Quotation_KawasakiCorporation”. Inthis case, the group name list is updated such that the generated groupname is held in “formName”, as indicated by the record with “groupNo” of6 in FIG. 17B.

The group name list updated by the processing in the flowcharts in FIGS.16 and 20 will be used to create the screen information of the storagesave screen 900. Thus, the group names registered in the updated groupname list can be displayed in the group bars 908 in the scanned formlist 901.

As described above, according to the present embodiment, the files ofform images obtained by scanning unlearned new forms for which nosetting-time input information has been saved can also be displayed suchthat the form images are grouped together with a group name indicatingtheir features displayed. In this way, it is possible to ease the burdenon the user for entering a group name and performing other similaroperations. Also, even in a case where a plurality of new forms arescanned, the user can easily find the file of a desired new form image.Thus, the burden on the user can be eased.

OTHER EMBODIMENTS

In the above embodiments, the analysis process in FIG. 11 and the groupname generation or update processes in FIGS. 14, 16, and 20 have beendescribed as processes performed by the image processing unit 432 of theMFP cooperative server 120. However, they may be performed by the imageprocessing unit 424 of the MFP 110.

In the case where the image processing unit 432 generates a group name,the image processing unit 432 may perform a process of excluding acharacter string such as one indicating a date and a form-specificunique character string such as “M5432” from the group name to begenerated. For example, in the case of a form in Japanese, the imageprocessing unit 432 may perform a process by which a character stringformed solely of at least one of a numeric value, alphabetic characters,or symbols or a character string combining a date and a numeric value isnot used in the group name.

Also, in Embodiments 2 and 3, configurations in which the MFPcooperative server 120 generates a group name have been described.Additionally, the group names in the storage save screen 900 may beeditable by the user after the storage save screen 900 is displayed. Forexample, a user instruction to add any character string to a group name,a user instruction to add the character string in any text block to agroup name, or the like may be accepted.

Embodiment(s) of the present disclosure can also be realized by acomputer of a system or apparatus that reads out and executes computerexecutable instructions (e.g., one or more programs) recorded on astorage medium (which may also be referred to more fully as a‘non-transitory computer-readable storage medium’) to perform thefunctions of one or more of the above-described embodiment(s) and/orthat includes one or more circuits (e.g., application specificintegrated circuit (ASIC)) for performing the functions of one or moreof the above-described embodiment(s), and by a method performed by thecomputer of the system or apparatus by, for example, reading out andexecuting the computer executable instructions from the storage mediumto perform the functions of one or more of the above-describedembodiment(s) and/or controlling the one or more circuits to perform thefunctions of one or more of the above-described embodiment(s). Thecomputer may comprise one or more processors (e.g., central processingunit (CPU), micro processing unit (MPU)) and may include a network ofseparate computers or separate processors to read out and execute thecomputer executable instructions. The computer executable instructionsmay be provided to the computer, for example, from a network or thestorage medium. The storage medium may include, for example, one or moreof a hard disk, a random-access memory (RAM), a read only memory (ROM),a storage of distributed computing systems, an optical disk (such as acompact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™),a flash memory device, a memory card, and the like.

While the present disclosure has been described with reference toexemplary embodiments, it is to be understood that the disclosure is notlimited to the disclosed exemplary embodiments. The scope of thefollowing claims is to be accorded the broadest interpretation so as toencompass all such modifications and equivalent structures andfunctions.

This application claims the benefit of Japanese Patent Application No.2020-089594 filed May 22, 2020, which is hereby incorporated byreference wherein in its entirety.

What is claimed is:
 1. An image processing apparatus comprising: adetermination unit configured to determine a similar form similar toscanned image for each of a plurality of scanned images; and a displaycontrol unit configured to perform control such that a list of theplurality of scanned images is displayed on a display unit, whereinscanned images of which the determined similar form are same aredisplayed together in the list.
 2. The image processing apparatusaccording to claim 1, wherein the determination unit is configured todetect a text block in each of the plurality of scanned images anddetermine a similar form similar to the scanned image based onarrangement of the detected text block.
 3. The image processingapparatus according to claim 2, further comprising a registration unitconfigured to register, as a new form, information on the arrangement ofthe text block in a scanned image for which the determination unitdetermines the similar form is not present.
 4. The image processingapparatus according to claim 3, wherein, in a case where an targetscanned image is registered as the new form by the registration unit,the determination unit is configured to determine a similar form similarto a scanned image to be targeted immediately after the target scannedimage among a form registered in advance and the new form registered bythe registration unit.
 5. The image processing apparatus according toclaim 1, wherein in a case where the determination unit determines thatthe similar form similar to an target scanned image, the determinationunit is configured to give the target scanned image the same identifieras an identifier given to the similar form, and the display control unitis configured to perform control such that scanned images given the sameidentifier among the plurality of scanned images are displayed together.6. The image processing apparatus according to claim 1, furthercomprising a character recognition unit configured to recognizecharacter strings contained in text blocks detected from the scannedimages.
 7. The image processing apparatus according to claim 6, furthercomprising: a first obtaining unit configured to, in a case where theplurality of scanned images include a first scanned image similar to afirst similar form for which information for setting a property has beensaved, obtain a character string forming a group name for the firstsimilar form; a second obtaining unit configured to, in a case where acharacter string contained in a text block in the first similar form isused in at least part of the group name for the first similar form,obtain a character string in the first scanned image that is present ina region corresponding to the text block in the first similar form; anda first update unit configured to update the group name for the firstsimilar form such that the group name is a group name generated toinclude a character string in matching portions of the character stringobtained by the first obtaining unit and the character string obtainedby the second obtaining unit and not to include character strings innon-matching portions of the character string obtained by the firstobtaining unit and the character string obtained by the second obtainingunit.
 8. The image processing apparatus according to claim 7, whereinthe display control unit is configured to display the group name updatedby the first update unit as a group name under which to display togetherscanned images similar to the first similar form.
 9. The imageprocessing apparatus according to claim 6, further comprising a thirdobtaining unit configured to, in a case where the plurality of scannedimages include an unlearned scanned image for which information forsetting a property has not been saved, obtain a character string forminga group name under which to display together learned scanned images forwhich information for setting a property has been saved; a fourthobtaining unit configured to obtain, from a second scanned image amongthe plurality of scanned images for which it is determined that thesimilar form is not present, information on a text block containing thecharacter string obtained by the third obtaining unit and a characterstring contained in the text block; a fifth obtaining unit configured toobtain a character string which is present in another unlearned scannedimage similar to the second scanned image and is contained in a textblock at a region corresponding to the text block obtained by the fourthobtaining unit; and a generation unit configured to generate a firstgroup name including a character string in matching portions of thecharacter string obtained by the fourth obtaining unit and the characterstring obtained by the fifth obtaining unit and not including characterstrings in non-matching portions of the character string obtained by thefourth obtaining unit and the character string obtained by the fifthobtaining unit, wherein the display control unit is configured todisplay the first group name as a group name under which to displaytogether the second scanned image and the other unlearned scanned imagesimilar to the second scanned image.
 10. The image processing apparatusaccording to claim 9, wherein the generation unit is configured togenerate the first group name such that the first group name includes acharacter string indicating that information for setting a property hasnot been saved.
 11. The image processing apparatus according to claim10, wherein in a case where the second scanned image does not includethe character string obtained by the third obtaining unit or thecharacter string in the matching portions is not present, the generationunit is configured to generate the first group name with the characterstring indicating that information for setting a property has not beensaved and a character string for distinguishing the first group namefrom another group name.
 12. The image processing apparatus according toclaim 9, further comprising a sixth obtaining unit configured to, in acase where a file name is set for the second scanned image and acharacter string contained in a text block in the second scanned imageis used in at least part of the set file name of the second scannedimage, obtain information on the text block and the character stringcontained in the text block used to set the file name; a seventhobtaining unit configured to obtain a character string which is presentin another unlearned scanned image similar to the second scanned imageamong the plurality of scanned images and is contained in a text blockat a region corresponding to the text block obtained by the sixthobtaining unit; and a second update unit configured to generate a secondgroup name and updates the first group name to the second group name,the second group name including a character string in matching portionsof the character string obtained by the sixth obtaining unit and thecharacter string obtained by the seventh obtaining unit and notincluding character strings in non-matching portions of the characterstring obtained by the sixth obtaining unit and the character stringobtained by the seventh obtaining unit.
 13. The image processingapparatus according to claim 7, wherein the group name is generated suchthat the character string forming the group name does not include acharacter string representing a date or a character string formed solelyof at least one of a numeric value, alphabetic characters, or symbols.14. The image processing apparatus according to claim 7, furthercomprising an acceptance unit configured to accept a user instruction toedit a generated group name.
 15. The image processing apparatusaccording to claim 1, wherein the display control unit is capable ofperforming control such that a file of a scanned image for whichinformation for setting a property has been saved and a file of ascanned image for which information for setting a property has not beensaved are displayed to be separate from each other.
 16. The imageprocessing apparatus according to claim 1, wherein the display controlunit is configured to perform control such that files of scanned imagesof which the similar forms are same are displayed together by using agroup bar displaying a group name.
 17. The image processing apparatusaccording to claim 16, wherein the display control unit is configured toperform control such that a group bar indicating a group of scannedimages for which information for setting a property has been saved and agroup bar indicating a group of scanned images for which information forsetting a property has not been saved differ from each other inbackground color.
 18. The image processing apparatus according to claim7, wherein the property includes at least a file name.
 19. An imageprocessing method comprising: determining a similar form similar toscanned image for each of a plurality of scanned images; and performingcontrol such that a list of the plurality of scanned images is displayedon a display unit, wherein scanned images of which the determinedsimilar form are same are displayed together in the list.
 20. Anon-transitory computer-readable storage medium storing a program whichcauses a computer to perform an image processing method comprising:determining a similar form similar to scanned image for each of aplurality of scanned images; and performing control such that a list ofthe plurality of scanned images is displayed on a display unit, whereinscanned images of which the determined similar form are same aredisplayed together in the list.